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Abstract: We study the possibility to separate in gluon fusion loop-induced Higgs boson 
production from point-like production. The Higgs boson is reconstructed in the H —> 77 
final state at very large transverse momentum. Using the Higgs boson yields (normalized to 
the overall rate) and the shape of the Higgs boson p± distribution the two hypotheses can 
be separated with 2 standard deviations with an integrated luminosity of about 500fb _1 . 
The largest experimental uncertainty affecting this estimate is the background event yield. 
The theoretical uncertainties from missing top mass effects are large, but can be decreased 
with dedicated calculations. 
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1 Introduction 

Since the discovery of a Higgs-like state at the LHC [1] huge efforts have been made to study 
its properties in detail [2] in order to test consistency with the predictions of a Standard 
Model (SM) Higgs particle [3]. Its couplings to vector bosons and fermions agree with SM 
expectations within errors. The present state of the analyses at the LHC underline the CP- 
even and spinless nature of the discovered particle. The dominant Higgs boson production 
mechanism within the SM is the loop-induced gluon-fusion process gg —>• H that is mediated 
by top and to a lesser extent bottom triangle loops as exemplified in Fig. 1. Although the 
total inclusive production rate agrees with the SM prediction within uncertainties, it is not 
clear that this production process is indeed loop-induced. For an experimental proof of the 
loop nature of the gluon-fusion process the top mass effects of the formfactors describing 
the Higgs coupling to gluons have to be measured. A point-like Higgs coupling to gluons is 
defined in terms of the effective Lagrangian 

n H 

H eff = c g ^G^G%- ( 1 . 1 ) 

with a s denoting the strong coupling constant, G a ' 111 ' the gluon held strength tensor, H the 
Higgs held and v the electroweak vacuum expectation value. The Wilson coefficient c g is 
adjusted to reproduce the measured inclusive cross section. In order to distinguish between 
a point-like Higgs coupling to gluons and the loop-induced type a large energy scale has to 
be inserted into the loop. At LO the gluon fusion process proceeds at the hxed scale given 
by the Higgs mass. The first distribution that gives rise to a variable energy scale inside 
the loops is the transverse momentum distribution of the Higgs particle that is generated 
primarily by the additional radiation of a gluon at LO (see Fig. 2). The comparison of the 
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Pt distribution at small and large values of pr allows for a distinction between top mass 
effects due to the loop contributions and a top mass independent point-like Higgs coupling 
to gluons [4], First studies of the prospects at the LHC to resolve the Higgs coupling to 
gluons have been made recently [5]. These studies, however, are all theory-based while a 
rigorous experimental study is missing so far. The purpose of this letter is to describe and 
present the results of our analysis of the pp spectrum of the Higgs particle with a detailed 
inclusion of experimental effects. In this way our study goes significantly beyond those of 
Refs. [5]. 



Figure 1. 


Gluon fusion gg —¥ H at leading order mediated by top and bottom triangle loops. 





Figure 2. Generic diagrams for Higgs production in association with a jet via gluon fusion at 
leading order mediated by top and bottom triangle loops generated by gg, gq, qq initial states. 

The NLO corrections to the p-p-distribution are only known in the limit of a heavy 
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top quark [ 6 ] supplemented by subleading terms in the inverse top mass at NLO [7]. As 
for the inclusive cross section the QCD corrections are large and positive. Recently the 
NNLO QCD corrections to the px distribution have been derived in the heavy top limit 
yielding a further moderate increase of ~ 30% [ 8 ], thus corroborating a reliable perturbative 
behaviour. 

Since the pure LO and NLO results diverge for px —>• 0, the small px region requires 
a soft gluon resummation for a reliable prediction. This resummation has been performed 
systematically for the top quark loops in Ref. [9], neglecting finite top mass effects at NLO. 
Soft gluon effects factorize, so that the top mass effects at small px are well approximated 
by the LO mass dependence for small Higgs masses [10-12], Since the top-loop contribu¬ 
tion dominates the cross section for the SM Higgs boson, the only limiting factor of the 
NLO+NNLL result as implemented in HqT or HRes [13] is thus the heavy-top approximation 
of the NLO corrections which affects the whole px range for large Higgs masses and the 
large px region in particular for all Higgs masses. It has been shown that the subleading 
NLO terms in the inverse top mass affect the px distribution by less than 10% for px < 300 
GeV, if the full LO mass dependence is taken into account [7]. 

In addition to the leading top-quark contribution the bottom quark loops amount 
to about — 6 % in the inclusive cross section [14]. However, at leading order the bottom 
quark contribution is only sizable for small px, while for larger px values it can safely 
be neglected. Recently, bottom quark contributions have been included in the predictions 
for the resummed p-p-distributions [10-12, 15-18]. However, soft-gluon-resummation and 
bottom-quark effects do not play a role in our analysis that just concentrates on the shape 
at large px values, where the resummed predictions coincide with the fixed-order results. 

2 Simulation setup 

The focus of this study is on Higgs boson decays to two photons, as this final state provides 
a reasonably large signal-to-background ( S/B) ratio combined with a fair Higgs boson 
transverse momentum (p±) resolution. The alternative decay H —> ZZ^*> —> A£(l = 
e, p) would offer superior S/B and p± resolution, but suffers from even smaller statistics 
due to the very small effective branching fraction. Signal (background) event samples 
corresponding to proton-proton collisions at ^/s = 14TeV are produced with NLO (LO) 
event generators. Subsequently these data are passed through a parametrized detector 
simulation. 

The signal events H — > 77 are generated with MCQNLO 4.10 [19, 20] with a dynamic 
scale of p = 2 \J rn H + Rl 2 > where rrin = 125 GeV is the Higgs boson mass. For the loop- 
induced process, the exact top and bottom quark mass dependence is taken into account 
following the two-step procedure advocated in Ref. [15]. The events are hadronized with 
Herwig++ [21] using the cluster model of [22]; the shower evolution is performed with the 
algorithm described in [23]. For systematic studies, the QCD renormalization px and 
factorization pp scales are varied according to the recipe provided in [24], i.e., from 0.5 
to 2.0 times the nominal value p, excluding Pr/Pf = 1/4 and 4. This variation not only 
changes the total production cross section, but also affects the shape of the Higgs boson 
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high-p_|_ distribution. The MSTW08 [25] set of parton distribution functions (PDF) is used 
as default, while for systematic studies, NNPDF 2.3 [26] and CT10 [27] are considered. The 
total gluon fusion cross section for Higgs boson production is normalized to er = 49.5 pb with 
the uncertainties mentioned in Ref. [24], Eventually, the total gluon fusion cross section will 
be measured with a substantially better experimental precision. The branching fraction for 
the considered final state is B(H —>• 77 ) = 2.28 X 1CV 3 , with a relative uncertainty of 5% 
[28] which has been obtained from combining results of Hdecay [29] and Prophecy4f [30]. 

Using Powheg [31] to simulate the two signal Higgs boson scenarios instead of MC0NL0 
does not change the conclusion of this paper in a significant way. 

The diphoton background events are generated with Sherpa VI.4.0 [32] with leading- 
order MSTW08 PDFs. In addition to the two photons, one or two jets are allowed in the 
process. Three requirements are applied at the generator level: (1) the generated photon 
energy is required to be E > 30GeV, (2) the invariant mass of the two photons must be 
in the range 50 < m 77 < 200 GeV and (3) the diphoton p_i has to fulfill p± > 130 GeV. In 
this kinematic region, the total cross section determined by Sherpa is Otot = 1.6 pb. We 
assume an uncertainty of 40% on this cross section in the high-pj_ region. 

The diphoton p± distribution has been measured by ATLAS and CMS in pp collisions 
at yfs = 7 TeV. ATLAS showed in Ref. [33] that for p± > 300 GeV the diphoton sample is 
dominated by two real photons and that contributions from one or two jets misidentified 
as photons are roughly at the 10% level. The high-p_|_ data are very well described in 
shape by the Sherpa (and Pythia) MC simulations. For the rate, the ATLAS collaboration 
scaled the expectations of Sherpa 1.3.1 up by 20%. The CMS collaboration [34] found very 
good agreement of Sherpa 1.4.0 (the same version is used for the present analysis) with 
the differential and integrated diphoton cross sections, albeit restricted to somewhat lower 
diphoton p±. 

The detector response simulation is based on Delphes 3 [35]. The default card for 
the CMS detector, included in the Delphes 3 distribution, is used. One modification is 
applied to the card: the cone for the calculation of photon isolation is reduced from A R = 
y/ A 7 2 + A cj) 2 = 0.5 to A R = 0.1. The original setting leads to a noticeable inefficiency for 
Higgs bosons measurements at very large p±, as the two photons are reconstructed close to 
each other. The performance of the ATLAS detector would likely be comparable, given the 
very similar performance of CMS and ATLAS in the Higgs boson measurements to date. 

A significant limitation at the high-luminosity LHC (HL-LHC) will be the pile-up, 
currently estimated at around 140 overlapping events per bunch crossing. At present there 
is no complete understanding on the exact details of photon measurement and identification 
performance for that LHC and detector configuration. The current ‘particle-flow’ approach 
(also implemented in Delphes 3) is likely to be replaced by more powerful algorithms, 
which will allow for much better mitigation of pile-up effects than the present photon 
reconstruction. For the time being, we do not include pile-up in the detector simulation 
and account for this with a large selection efficiency uncertainty. It should be stressed that 
in this analysis no jet information is used, the only ingredients are high-p_|_ photons. 

We assume furthermore that events with H — > 77 candidates at high p± can be trig¬ 
gered with a negligible inefficiency. 
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3 Analysis 


The purpose of this analysis is an estimate of the expected sensitivity to discriminate be¬ 
tween the loop-induced and a point-like Higgs boson coupling. For this purpose, Higgs 
boson samples corresponding to the two hypotheses combined with diphoton background 
samples are studied. The invariant mass distribution provides the primary means to sep¬ 
arate the Higgs boson signal from the diphoton background. As the expected number of 
Higgs bosons in the high-pj_ region differs between the two hypotheses, the invariant mass 
distribution also provides a powerful handle to distinguish between the two hypotheses. In 
addition the p± distribution, where the two hypotheses lead to different shapes, provides 
additional separation power as illustrated in Fig. 3 (left). 



Figure 3. Differential production cross section as generated (left) and reconstructed after all 
analysis requirements (right). The effective cross section includes B(H —>• 77 ) for the Higgs boson 
cross sections (left and right), and the analysis efficiency for all cross sections (right). The error 
bars show the statistical uncertainty only. 

The data analysis uses reconstructed objects from the Delphes 3 detector simulation. 
No attempt is made to unfold the true Higgs boson p_\_ spectrum from the ‘reconstructed’ 
spectrum. Rather the ‘reconstructed’ pj_-spectra from Higgs boson signal hypotheses and 
diphoton background are used to fit the combined ‘data’ spectra. 

3.1 Event Reconstruction 

Reconstructed and identified photons from Delphes 3 are subject to additional require¬ 
ments. The isolation requirement of each photon is further tightened: the maximum en¬ 
ergy flow in the form of tracks or energy clusters (except for the other photon forming the 
Higgs boson candidate) allowed within a cone of A R < 0.3 around the photon momentum 
is required to have p± < 0.5 GeV. The efficiency of this requirement is 52% for signal 
events and 17% for background events, nearly independent of the diphoton p± requirement. 
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This requirement is quite strict, but not inconsistent with present capabilities [36] using 
area-based energy subtraction procedures [37]. As mentioned above, an improved pileup 
treatment will be required for the successful usage of the photon isolation variable at the 
HL-LHC. 

The two photons with the largest p± are combined to form Higgs boson candidates. 
The diphoton invariant mass must fulfill 70 < m 77 < 180 GeV. The leading (subleading) 
photon has to fulfill Ep > 80(50) GeV and for both photons 1 77 ! < 2.5 is required. No 
jet information is used in this analysis, even though implicitly there will be jets as we 
require for the diphoton p± > 300 GeV (this will be referred to as high-pj_ region). For this 
kinematic region the signal (background) selection efficiency is 32.3% (3.4%). The values 
for the single and diphoton p± requirements were optimized by maximizing the expected 
hypothesis discrimination sensitivity. 

In Fig. 3 (right) the effective differential cross section for signal and background after the 
event selection is shown. The background contribution is found to be very large even after 
the above selection. The average expected diphoton invariant mass and pj_ distributions for 
an integrated luminosity of 1000 fb -1 are shown in Fig. 4, where signal and background have 
been combined (in contrast to Fig. 3, where signal and background are shown separately). 
The Higgs boson invariant mass resolution depends strongly on the diphoton p± and is 
limited by the angular resolution in the kinematic range of this analysis. 




P T [GeV] 


Figure 4. Average expected event yields vs diphoton invariant mass (left) and diphoton p j_ (right), 
normalized to 1000 fb -1 . The histograms contain the background plus the point-like Higgs produc¬ 
tion (open triangles) and the background plus the loop-induced Higgs production (open circles). 
The background component is identical for both hypotheses. The error bars show the statistical 
uncertainty only. 

The invariant mass distribution for the signal is well described by a single Gaussian with 
standard deviation a = 9.4 GeV in the high-pj_ region. The background is parametrized 
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with a first degree polynomial. The rising background invariant mass shape arises from 
the selection criteria applied, in particular the very high-pj_ requirement and the photon 
isolation criteria. In the high-p_|_ region, the shape of the signal and background p± distri¬ 
butions can be described with a log normal distribution. The free parameters of the fit are: 
the Higgs boson yield, the background yield, two parameters (/i and k) each for the signal 
and background log normal distributions, and the slope of the polynomial describing the 
background mass distribution. The position and width of the Higgs boson peak are fixed. 

3.2 Sensitivity Analysis 

The sensitivity study is based on a two-dimensional extended unbinned maximum likeli¬ 
hood fit to the diphoton invariant mass and p± distributions using the probability density 
functions described above. The Hq hypothesis corresponds to the scenario of loop-induced 
Higgs boson production plus the diphoton background. The H\ hypothesis describes the 
point-like Higgs boson production together with the diphoton background. 

Toy data are generated for the two hypotheses, where the event numbers in the toy 
datasets depend on the integrated luminosity studied. In Fig. 5 an example toy data set is 
shown. In the high-p_|_ region and for a luminosity of £ = 1000 fb _1 the expected background 
yield is Nb = 1360, while the signal components for the respective hypotheses are Ng° = 86 
and Ng 1 = 150 (see below for a discussion of the systematic uncertainties). From the 
negative log-likelihood ratio for the two hypotheses, a test statistic q = —2In [Ch 1 /C-h 0 ) 
is constructed. The background parameters (normalization and the shape of the mass 
distribution and the shape of the p± distribution) are treated as nuisance parameters and 
profiled over. The expected separation between Hq and H\ is quantified as follows [38-40]: 
We determine a midpoint q between the medians of the two test statistic distributions such 
that V(q > q\Hi) = V(q < q\Ho). This tail probability is converted into a significance Z 
using the one-sided Gaussian tail convention. For the separation power we quote Z = 2 Z, 
as the midpoint q is half-way between the medians of the two test statistic distributions. 
Using 10 000 toy data sets a relative sensitivity uncertainty of about 1% is achieved. To 
study systematic uncertainties, different toy data sets are generated corresponding to the 
uncertainty under study. 

4 Results 

In Fig. 6 the result of the sensitivity study is shown. The central expectation corresponds to 
the expected signal and background numbers as discussed above. Systematic uncertainties 
on the expected sensitivity arise from various sources. 

Experimental uncertainties have a significant impact as this sensitivity study ventures 
into a pile-up regime where adequate algorithms are still to be developed. For the signal 
efficiency we assume a relative uncertainty of ±20%. As no diphoton measurement exists for 
the kinematic range considered in this analysis, and since no jet to photon misidentihcation 
is considered, an uncertainty of ±40% is used for the background yield. The sampling 
uncertainty from the limited statistics in the toy data sets is minor for the sensitivity 
regime found here. 
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Background plus Higgs (loop-induced) 


Background plus Higgs (loop-induced) 




Background plus Higgs (point-like) Background plus Higgs (point-like) 




Figure 5. Example toy data set with overlayed projections of the extended unbinned maximum 
likelihood fits for the diphoton invariant mass (left) and the diphoton p± (right). The Hq hypothesis 
(loop-induced Higgs boson production) is shown in the top row, the H\ hypothesis (point-like Higgs 
boson production) is shown in the bottom row. The background component is identical for both 
hypotheses. The (black) solid curves shows the signal+background fit, the (red) dashed line shows 
the background component. The error bars show the statistical uncertainty only. 


Theoretical uncertainties have a larger impact on the expected sensitivity. The scale 
uncertainties from and pf affect both the event yields for the two hypothesis and (to 
a smaller extent) the shape of the p± distributions. The missing top mass effects are 
parametrized with 


Ax 


fp ± -40GeV 
V 100 GeV 


2 

x 1.5%, 
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where x = da/dp±. This parametrization has been adjusted to the known systematic top 
mass effects obtained in Ref. [7]. Uncertainties from a variation of the PDFs or the top 
quark mass are found to be negligible. 
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Figure 6. Expected sensitivity vs. integrated luminosity. The experimental uncertainties (left) are 
combined quadratically, the theoretical uncertainties (right) are added linearly. 

The two Higgs boson hypotheses can be separated with 2 standard deviations (cr) with 
an integrated luminosity of about 500 fb -1 . With a luminosity of 1000 fb -1 the expected 
separation sensitivity is about 3cr. The largest experimental uncertainty (downwards) arises 
from the background uncertainty in the high-p_|_ region. A combination with other Higgs 
boson final states and a more elaborate analysis would help to improve the sensitivity. 
However, theoretical uncertainties and missing top mass effects also have a large impact 
on the sensitivity as can be inferred from Fig. 6 (right panel). A reduction requires the 
full implementation of higher-order effects on the transverse-momentum distribution in the 
generators and in particular the full top mass effects at NLO that have not been calculated 
so far. 


5 Conclusions 

We have studied the possibility to separate in gluon fusion loop-induced Higgs boson pro¬ 
duction from point-like production. Using the Higgs boson yields (normalized to the overall 
rate) and the shape of the Higgs boson p± distribution the two hypotheses can be separated 
with 2 cr with an integrated luminosity of about 500fb -1 (likely after Run-2 of the LHC). 
The largest systematic uncertainty affecting this estimate is the background event yield, as 
that strongly dilutes the difference between the two Higgs boson hypotheses. Understanding 
and mitigating the impact from pile up will be crucial for this analysis. 
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